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Abstract. Well characterized sequences of dynamical states play an important role 
for motor control and associative neural computation in the brain. Autonomous 
dynamics involving sequences of transiently stable states have been termed associative 
latching in the context of grammar generation. We propose that generating functionals 
allow for a systematic construction of dynamical networks with well characterized 
dynamical behavior, such as regular or intermittent bursting latching dynamics. 

Coupling local, slowly adapting variables to an attractor network allows to 
destabilize all attractors, turning them into attractor ruins. The resulting attractor 
relict network may show ongoing autonomous latching dynamics. We propose to 
use two generating functionals for the construction of attractor relict networks. The 
first functional is a simple Hopheld energy functional, known to generate a neural 
attractor network. The second generating functional, which we denote polyhomeostatic 
optimization, is based on information-theoretical principles, encoding the information 
content of the neural firing statistics. Polyhomeostatic optimization destabilizes the 
attractors of the Hopfield network inducing latching dynamics. 

We investigate the influence of stress, in terms of conflicting optimization targets, on 
the resulting dynamics. Objective function stress is absent when the target level for the 
mean of neural activities is identical for the two generating functionals and the resulting 
latching dynamics is then found to be regular. Objective function stress is present 
when the respective target activity levels differ, inducing intermittent bursting latching 
dynamics. We propose that generating functionals may be useful quite generally for 
the controlled construction of complex dynamical systems. 
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1. Introduction 

The use of objective functions for the formulation of complex systems has seen a 
study surge of interest. Objective functions, in particular objective functions based on 
information theoretical principles [D [21 HH El El El E] , are used increasingly as generating 
functionals for the construction of complex dynamical and cognitive systems. There 
is then no need to formulate by hand equations of motion, just as it is possible, 
in analogy, to generate in classical mechanics Newton's equation of motion from an 
appropriate Lagrange function. When studying dynamical systems generated from 
objective functions encoding general principles, one may expect to obtain a deeper 
understanding of the resulting behavior. The kind of generating functional employed 
also serves, in addition, to characterize the class of dynamical systems for which the 
results obtained may be generically valid. 

Here we study the interplay between two generating functionals. The first 
generating functional is a simple energy functional. Minimizing this objective function 
one generates a neural network with predefined point attractors, the Hopfield net 
[HI [in]- The second generating functional describes the information content of the 
individual neural firing rates. Minimizing this functional results in maximizing the 
information entropy [61 [7] and in the generation of adaption rules for the intrinsic 
neural parameters, the threshold and the gain. This principle has been denoted 
polyhomeostatic optimization [3 [11], as it involves the optimization of an entire function, 
the distribution function of the time-averaged neural activities. 

We show that polyhomeostatic optimization destabilizes all attractors of the 
Hopfield net, turning them into attractor ruins. The resulting dynamical network is an 
attractor relict network and the dynamics involves sequences of continuously latching 
transient states. This dynamical state is characterized by trajectories slowing down close 
to the succession of attractor ruins visited consecutively. The two generating functionals 
can have incompatible objectives. Each generating functional, on its own, would lead 
to dynamical states with certain average levels of activity. Stress is induced when these 
two target mean activity levels differ. We find that the system responds to objective 
function stress by resorting to intermittent bursting, with laminar flow interseeded by 
burst of transient state latching. 

Dynamical systems functionally equivalent to attractor relict networks have been 
used widely to formulate dynamics involving on-going sequences of transient states. 
Latching dynamics has been studied in the context of the grammar generation with 
infinite recursion [T21 H3] and in the context of reliable sequence generation [TJ1 [To] . 
Transient state latching has also been observed in the brain [161 E] and may constitute 
an important component of the internal brain dynamics. This internal brain dynamics 
is autonomous, ongoing being modulated, but not driven, by the sensory input [HI [19] . 
In this context an attractor relict network has been used to model autonomous neural 
dynamics in terms of sequences of alternating neural firing patterns [20] • The modulation 
of this type of internal latching dynamics by a stream of sensory inputs results, via 
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unsupervised learning, in a mapping of objects present in the sensory input stream to 
the preexisting attractor ruins of the attractor relict network [211 [22] . the associative 
latching dynamics thus acquiring semantic content. 



2. Generating functionals 

We consider here N rate encoding neurons in continuous time, with a non-linear transfer 
function g(x), 

V% = 9i{xi) = 1 - eai{bi _ Xi) , (!) 

where Xi(t) G 1Z is the membrane potential and Ui{t) G [0, 1] the firing rate of the ith 
neuron. The dynamics of the neural activity is 

N 

±i = -Txi + ^ w iiVi ' ( 2 ) 

3=1 

which describes a leaky integrator, with T being the leak rate and the Wij the inter- 
neural synaptic weights. This kind of dynamics can be derived minimizing a simple 
energy functional] 

ij i 

with respect to the membrane potential Xi [9] [10] . When using an energy functional to 
derive ([2]), the resulting synaptic weights are necessarily symmetrized, viz Wy = Wji. 
Here we do indeed consider only symmetric synaptic links, the dynamics of a network 
of leaky integrators would however remain well behaved when generalizing to non- 
symmetric weight matrices. 

The transfer function ([T]) contains two intrinsic parameters, a and b, which can be 
either set by hand or autonomously adapted, an approach considered here. The intrinsic 
parameters determine the shape of the individual firing-rated distributions Pi(y), given 
generically by 



1 



r 



vM = fj o 8{v-Vi(t-T))dT, (4) 

where S(-) is the Dirac 5-function, and T — > oo the observation time-interval. The 
Kullback-Leibler divergence [23] 

DKh(Pi,q) = [ dyp;(y)log^y (5) 

Jo q{y) 

measures the distance between the neural firing rate distribution Pi(y) and a normalized 
distribution q(y), with D KL > generically and D KL = only for Pi(y) = q(y). Gaussian 
distributions 

q(y) oc exp(Aiy + X 2 y 2 ) (6) 



f Strictly speaking, minimizing (j3J), one obtains xi — —Txi + aj(l — Di)yi Y) j w ijUj an< i n °t |2|- It is 
however custom to neglect the extra term aj(l — Vi)Vi- The term oc T in ([3]) restricts runaway growth 
of the membrane potentials. 
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ai = e a (l/ai + (xi-b i )[l-2y i + (Xi + 2X 2 yi){l-yi)yi]] 
bi = e b ( - ^ [1 - 2y { + (Ai + 2X 2 yi) (1 - yi) yi 



maximize the information entropy whenever mean and standard deviation are given [23J . 
Minimizing the Kullback-Leibler divergence ^ with respect to the individual intrinsic 
parameters and bi is hence equivalent to optimizing the information content of the 
neural activity [6]. Using variational calculus one obtains [TTJ [21] 

— vA I 

(7) 

where the e a and e b are adaption rates. When these adaption rates are small, with respect 
to the time scale of the cognitive dynamics ([2]), an implicit time average is performed 
[TT1 124] . compare Q, and the respective firing rate distribution Pi(y) approaches the 
target distribution function q(y). The adaption rules ^ implement the optimization 
of an entire distribution function, and not of a single scalar quantity, and are hence 
equivalent to a polyhomeostatic optimization process [7]. 

The original network ^ has point attractors, as given by Xi = 0, Wi. These 
attractors are destabilized for any e a , e b > 0, the coupled system ^ and ^ has no 
stationary points, as defined by x = = a = b. Stationary point attractors lead to 
neural firing statistics far from the target distribution ^ and to a large Kullback- 
Leibler divergence D KL , and hence to finite stochastic gradients ([7]). The principle of 
polyhomeostatic adaption is hence intrinsically destabilizing. 

In deriving the stochastic adaption rules ^ one rewrites the Kullback-Leibler 
divergence -Dkl as an integral over the distribution Pi(x) of the respective membrane 
potential Xi, namely as Dkl = J dxpi(x)d(x), with an appropriate kernel d(x). The 
optimal overall adaption rules depend on the specific shape of Pi(x). The optimal 
adaption rates not dependent on the distribution of the membrane potential are, on the 
other side, given by minimizing the kernel, a oc dd(x)/da and b oc dd(x)/db respectively, 
which leads to the adaption rates ([7]), which are instantaneous in time [H HU |2"4"] . 

3. Attractor relicts in a three site network 

Starting, we consider now one of the simplest possible networks having two distinct 
attractors in terms of minima of the energy functional The dynamics of the 3-site 
network illustrated in Fig. [T] is given by 
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with w + > and w~ > denoting the excitatory and inhibitory link strength 
respectively. For fixed intrinsic parameters a and b, viz for vanishing adaption rates 
e a , e b = 0, the network has two possible phases, as shown in Fig. [TJ There is either a 
single global attractor, with activities yi determined mostly by the value of the threshold 
b, or two stable attractors, with either yi or y 3 being small and the other two firing rates 



large. We will discuss the phase diagram in detail in Sect. 3.1 
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Figure 1. Left: A three-site graph with symi&etric excitatory (solid green lines) and 
inhibitory connections (red dashed lines). Right: The corresponding phase diagram, 
for fixed T = 1 and w + = 1 = w~ , as function of the gain a = a t and threshold 
b = hi, for i = 1,2,3. For large/small thresholds the sites tend to be inactive/active. 
The color encodes the activity of the most active neuron. Inside the shaded area 
there are two stable attractors, outside a single, globally attracting state. The binary 
representations of the two non-trivial attractors are (1,1,0) and (0,1,1), where 1/0 
denotes an active/inactive site. The lower and upper lines (blue/white) of the phase 
transition are of second and first order respectively, with the lower phase transition 
line determiend by 1 — ay(l — y) where y = y\ = 7/3 is the activity of the single fixpoint 
at site one and three. The two critcal lines meet at (a c ,b c ) — (4,0.413) (blue dot). 



The point attractors present for vanishing intrinsic adaption become unstable for 
finite adaption rates e a , > 0. Point attractors, defined by = = jji, have 
time- independent neural activities. The objective (|5]), to attain a minimal Kullback- 
Leibler divergence, can however not be achieved for constant firing rates yi. The 
polyhomeostatic adaption ^ hence forces the system to become autonomously active, 
as seen in Fig. [2j We denote with 



e = xw, \\e\ 



\ 



N 



E(£f) 2 ( 9 ) 



the binary activity patterns of the attractors for the N = 3 network, = (1,1,0) 
and £ 2 = (0, 1, 1). In the following we want to investigate to which degree the original 
binary attractors are retraced. For this propose we define two criteria, the first being 
the overlap O p G [0, 1], 

o P = J^, (e,y) = f:^, (io) 

which corresponds to the cosine of the angle between the actual neural activity 
(yi, . . . ,i/n) and the attractor state £ p . As a second measure, of how close the actual 
activity pattern and the original attractors are, we consider the reweighted scalar 
product A p G [0, 1] 

A P = (e,y)/zli$ ■ (ii) 

Note that the £f are positive, representing the neural firing rate in attractor states. 
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Figure 2. Time series of the three-site network shown in Fig. [TJ for T = 1, e a = 0.1, 
e b = 0.01, Ai = 0, A 2 = and w ± = 1. The dynamics is given by Q and Q. The 
dynamics retraces periodically the original attractor states, as one can see from the 



oscillation of the overlap O p and the A p , compare Eqs. (10) and ( |ll| , between the 
patterns of neural and attractor activities. 



The data for O p and A p presented in Fig. [2] shows that the attractors become 
unstable in the presence of finite polyhomeostatic adaption, retaining however a 
prominent role in phase space. Unstable attractors, transiently attracting the phase 
space flow, can be considered to act as 'attractor relicts'. The resulting dynamical system 
is hence an attractor relict network, viz a network of coupled attractor ruins [201 EI] ■ 
When coupling an attractor network to slow intrinsic variables, here the threshold b 
and the gain a, the attractors are destroyed but retain presence in the flow in terms of 
attractor ruins, with the dynamics slowing down close to the attractor relict. 

The concept of an attractor ruin is quite general and attractor relict networks have 
implicitly been generated in the past using a range of functionally equivalent schemes. It 
is possible to introduce additional local slow variables, like a reservoir, which are slowly 
depleted when a unit is active [201 121] . Similarly, slowly accumulating spin variables 
have been coupled to local dynamic thresholds in order to destabilize attractors [14], 
and local, slowly adapting, zero state fields have been considered in the context of 
latching Potts attractor networks jT2j, [131 [2HI [27] . Alternatively, attractors have been 
destabilized in neural networks by introducing slowly adapting asymmetric components 
to the inter-neural synaptic weights [13] . 



Generating junctionals for latching dynamics 



7 




2 4 6 8 

Figure 3. The trajectories or (aj(i), 6j(t)) for finite adaption rates, e a — 0.1 and 
£h = 0.01, superimposed onto the phase diagram of the three-site network shown in 
Fig. [T] The color encodes the activity of the most active site. Inside the shaded 
blue area two attractors are stable, outside there is only a single global attractor. 
The parameters are the same as for the simulation presented in Fig. [2] The thresholds 
&i(t) and b 3 (t) (red/blue trajectories) oscillate across the second-order phase boundary 
(blue line), the threshold b 2 (t) (green trajectory) acquires a large value, receiving two 
excitatory inputs, and avoids the first-order phase boundary (white line) during the 
initial transient. All three trajectories start in the center, lower-half of the phase 
diagram, at a = 5, b = —0.5 (black dot). Note that the underlying phase diagram is 
for identical thresholds bi and gains a^. 

3.1. Phase boundary adaption 

In Fig. [3] we have superimposed, for the three-site network, a typical set of trajectories, 
with finite adaption rates e a = 0.1 and tb = 0.01, onto the phase diagram evaluated 
for non-adapting neurons, with e a = = e^, as shown in Fig. [Tj The three trajectories 
(green: (a 2 ,& 2 ), red and blue: (ai,&i) and (a 3 ,5 3 )) all start well in the region with a 
single global fixpoint, at = 5, hi = —0.5 (i = 1,2,3). Two features of the flow are 
eye-catching. 

• The intrinsic parameters of the active neurons settle, after a transient initial period, 
at the phase boundary and oscillate alternating between the phase with a single 
stable fixpoint and the phase with two attractors. 

• The trajectory (a 2 , & 2 ) for the central neuron adapts to a large value for the 
threshold, taking care not to cross any phase boundary during the initial transient 
trajectory 

We first discuss the physics behind the second phenomenon. The Euclidean distance 
in phase space between the two attractors, when existing, can be used as an order 
parameter. In Fig. [4] the order parameter is given for vertical cuts through the phase 
diagram. The lower transition is evidently of second order, which can be confirmed by 
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Figure 4. Vertical cuts (for constant gains a = 4,0, 4.3, 4.7, 5.0, 6.0, bottom to 
top curves) through the phase diagram of the 3-site graph shown in Fig. [I] Plotted is 

the order parameter )Jy^=i ~ 2/; 2 ') > where y^ — {yi l \y2 l \y^) are the two 
fixpoint solutions (7= 1,2) in the coexistent region. The lower/upper transitions are 
second/first order. 




a stability analysis of ([8]), with the upper transition being of first order, in agreement 
with a graphical analysis of ([8]). A first order transition will not be crossed, in general, 
by an adaptive process, as there are no gradients for the adaption to follow. This is the 
reason that the trajectory for (02, b 2 ) shown in Fig. [3] needs to take a large detour in 
order to arrive to its target spot in phase space. This behavior is independent of the 
choice of initial conditions. 

Using an elementary stability analysis one can show that the condition for the 
second-order line is 1 = ay(l — y), where y = y\ — y$ (for the single global 
fixpoint). Only a single global attractor is consequently stable for a < 4 (since 
y G [0, 1] and y{l — y) < 0.25) and the second and the first order lines meet at 
(a c , b c ) = (4,0.413), where the critcal threshold b c is determined by the self-consistent 
solution of b c = l/[l + exp(4fe c — 4)] — 1/2. The trajectories (ai,&i) and (03,63) are hence 
oscillating across the locus in phase space where there would be a second-order phase 
transition, compare Fig. |3j for the case of identical internal parameters and 6«. With 
adaption, however, the respective internal parameters take distinct values, (a, b ± 5b) for 
the first /third neuron and (a, 62) for the second neuron, with a ~ 6, b ~ and hi ~ 1. In 
fact the system adapts the intrinsic parameters dynamical in a way that a non-stopping 
sequence of states 

... -> e -> e e -> £° -> e 1 -> ••• (12) 

is visited, where = (1,1,0), £ 2 = (0,1,1) denote the two stable cliques and 
£° = (1,1,1) the fixpoint in the phase having only a single global attractor. It is 
hence not a coincidence that the system adapts autonomously, as shown in Fig. [3j to a 
region in phase space where there would be a second order phase transition for identical 
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Figure 5. Left: Example of Np = 7 random binary patterns £ p , compare (15), with 
sparseness a = 0.3 for a network of N — 100 neurons. Right: Respective pairwise 
mutual pattern overlaps Patterns maximally overlap with themselves, the 

inter-pattern overlap is random and small. 



internal parameters. At this point, small adaptive variations make the sequence (12) 
easily achievable. The adaption process hence shares some aspects with what one calls 
'self-organized quasi-criticality' in the context of absorbing phase transitions [8]. 



4. Latching in Hopfield networks with objective function stress 

We will now investigate networks of larger size N. In principle we could generate 
random synaptic link matrices and find their eigenstates £ p = (£f, . . . , using 

a diagonalization routine. Alternatively we can use the Hopfield encoding J9j [10] 

1 Np N P 



Wij oc 



(13) 



p=i p=i 

for the synaptic weights w^, where £j are the arithmetic means of all pattern activities, 
for the respective sites. Here N p is the number of encoded attractors and a G [0, 1] the 
mean pattern activity, 



a 



N ^ 



(14) 



When using the Hopfield encoding (13), the attractors are known to correspond, to a 
close degree, to the stored patterns £ p , as long as the number N p of patterns is not too 
large [91 [10]. We here make use of the Hopfield encoding for convenience, no claim is 



made that memories in the brain are actually stored and defined by (13). 

We consider in the following random binary patterns, as illustrated in Fig. [5j 

1 with probability a 
with probability 1 — a 



(15) 



where a is the mean activity level or sparseness. The patterns £ p have in general a 
finite, albeit small, overlap, as illustrated in Fig. [5} 
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Figure 6. Overlaps O p and A p (color encoded, see Eqs. ( 10 ) and ( 11 )), for a N = 100 



network, of the neural activities with the N p — 7 attractor ruins, as a function of 
time t. The sparseness of the binary patterns defining the attractor ruins is a = 0.3. 
Here the average activity a of the attractor relicts and the target mean activity level 
/i = 0.3 have been selected to be equal, resulting in a limiting cycle with clean latching 
dynamics. 



4-1. Regular latching dynamics in the absence of stress 

The target distribution q(y) for the intrinsic adaption has an expected mean 



dyyg(y) 



+ 



(16) 



a 2 =o Ai ' e Al — 1 

which can be evaluated noting that the support of q(y) is [0, 1]. The target mean activity 
fi can now differ from the average activity of an attractor relict, the mean pattern 



activity a, compare (14). The difference between the two quantities, viz between the 



two objectives, energy minimization vs. polyhomeostatic optimization, induces stress 
into the latching dynamics, which we will study in the following. 

We simulated Eqs. ^ and ^ using 4th order classical Runge-Kutta [25] and a 
timestep of At = 0.1. The resulting dynamics is dependent on the magnitude of the 
adaption rates for the gain a and for the threshold b. In general latching dynamics is 
favored for small e& ~ 0.01 and somewhat larger e a ~ 0.01 ... 1. We kept a constant leak 
rate F — 1. The results presented in Figs. [6] and [7] have been obtained setting A2 = 
and using Table 1 for Ai 



Table 1. Relation between the parameter Ai, for A2 = 0, and the mean value fi, as 



given by Eq. 16 for the target distribution q(y). 
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Figure 7. Time evolution, for a selection of 20 out of N — 100 neurons, of the 
membrane potential Xj, gain dj, threshold 6j and firing rate yi for the time series of 
overlaps presented in Fig. [6] 

In Fig. [6] we present the time evolution of the overlaps A p and O p for a = 0.3 = /i 
and adaption rates e a = 0.1, e& = 0.01. In this case the target mean activity, /i, of the 
intrinsic adaption rules ^ is consistent with the mean activity a of the stored patterns 
£ p , viz with the mean activity level of the attractor relicts. One observes that the 
system settles into a limiting cycle with all seven stored patterns becoming successively 
transiently active, a near-to-perfect latching dynamics. The dynamics is very stable and 
independent of initial conditions, which were selected randomly. 

In Fig. [7] we present the time evolution, for 20 out of the N = 100 neurons, of the 
respective individual variables. The simulation parameters are identical for Figs. [6] and 
[7| Shown in Fig. [7] are the individual membrane potentials Xi(t), the firing rates yi(t), 
the gains aiit) and the thresholds bi(t). The latching activation of the attractor relicts 
seen in Fig. [6] reflects in corresponding transient activations of the respective membrane 
potentials and firing rates. The oscillations in the thresholds bi(t) drive the latching 
dynamics, interestingly, even though the adaption rate is larger for the gain. 

The synaptic weights Wy are symmetric and consequently also the overlap matrix 
presented in Fig. [5| The latching transitions evident in Fig. [6] are hence spontaneous 
in the sense that they are not induced by asymmetries in the weight matrix [12]. We 
have selected uncorrelated patterns £ p and the chronological order of the transient states 
is hence determined by small stochastic differences in the pattern overlaps. It would 
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Figure 8. Time evolution of overlaps O v and A p (color coded, compare Eqs. (10) and 



( 11 )) for all Np — 7 binary patterns, with sparseness a = 0.3, for a network of TV = 100 
neurons. The target mean neural activity is /i — 0.15, the difference between the two 
objective functions, viz between fi and a induces stress. The intermittent latching 
dynamics has a mean activity of about 0.3, which is too large. The phases of laminar 
flows between the burst of latching has a reduced average activity, thus reducing the 
time-averaged mean activity level toward the target fi — 0.15. 



however be possible to consider correlated activity patters £ p incorporating a rudimental 
grammatical structure [26], [27], which is however beyond the scope of the present study 

4-2. Stress-induced intermittent bursting latching 

In Fig. [8] we present the time evolution of the overlaps A p and O p for the case when the 
two objective functions, the energy functional and the polyhomeostatic optimization, 
incorporate conflicting targets. We retain the average sparseness a = 0.3 for the stored 
patterns, as for Fig. [6j but reduced the target mean firing rate to fi = 0.15. This 
discrepancy between a and /i induces stress into the dynamics. The pure latching 
dynamics, as previously observed in Fig. [6j corresponds to a mean activity of about 0.3, 
in conflict with the target value \x = 0.15. 

Phase of laminar flow are induced by the objective function stress, and the latching 
dynamics occurs now in the form of intermittent bursts. The neural activity, see A p 
in Fig. [8j is substantially reduced during the laminar flow and the time averaged mean 
firing rate such reduced towards the target activity level of /i = 0.15. The trajectory 
does not come close to any particular attractor relict during the laminar flow, the overall 
O p being close to 0.5 for all N p = 7 stored pattern. 

The time evolution is hence segmented into two distinct phases, a slow laminar 
phase far from any attractor and intermittent phases of bursting activity in which the 
trajectories linger transiently close to attractor relicts with comparatively fast (relative 
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Figure 9. Time evolution, for N = 1000, of the overlaps O p for all Np — 20 binary 
patterns (vertically displaced) with sparseness a — 0.2 and a target mean activity of 
fi = 0.2. There is no objective-function stress and the latching is regular, the adaption 
rates are e a — 0.1, = 0.01. 

to the time scale of the laminar phase) latching transitions. Intermittent bursting 
dynamics has been observed previously in polyhomeostatically adapting neural networks 
with randomly selected synaptic weights [HE], the underlying causes had however not 
been clear. Using the concept of competing generating functionals we find that objective 
function stress is the underlying force driving the system into the intermittent bursting 
regime. 

4-3. Robustness of transient state dynamics 

The latching dynamics presented in Figs. [6] and [8] is robust with respect to system 
size N. We did run the simulation for different sizes of networks with up to iV = 10 5 
neurons, a series of sparseness parameters a and number of stored patterns Np. As an 
example we present in Fig. [9] the overlap O p for N = 1000 neurons and Np = 20 binary 
patterns with a sparseness of a = 0.2. No stress is present, the target mean activity 
level is \i — 0.2. The latching dynamics is regular, no intermittent bursting is observed. 
There is no constraint, generically, to force the limiting cycle to incorporate all attractor 
relicts. Indeed, for the transient state dynamics presented in Fig. [9j some of the stored 
patterns are never activated. 

The data presented in Figs. [6j [8] and [9] is for small numbers of attractor relicts 
N p , relative to the systems size N, a systematic study for large values of N p is beyond 
the scope of the present study. Latching dynamics tends to break down, generically 
speaking, when the overlap between distinct attractors, as shown in Fig. [5j becomes 
large. The autonomous dynamics then becomes irregular. 
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5. Discussion 

The use of generation functionals has a long tradition in physics in general 
and in classical mechanics in particular. Here we point out that using several 
multivariate objective functions may lead to novel dynamical behaviors and an improved 
understanding of complex systems in general. We propose in particular to employ 
generating functionals which are multivariate in the sense that they are used to derive 
the equations of motion for distinct, non-overlapping subsets of dynamical variables. In 
the present work we have studied a neural network with fast primary variables Xi(t), 
the membrane potentials, and slow secondary variables Oi(t) and bi(t), characterizing 
the internal behavior of individual neurons, here the gains Oj and the thresholds hi. The 
time evolution of these sets of interdependent variables is determined respectively by 
two generating functionals: 

ENERGY FUNCTIONAL 

Minimizing an energy functional generates the equation of motion (leaky integrator) 
for the primary dynamical variables Xi(t), the individual membrane potentials. 

INFORMATION THEORETICAL FUNCTIONAL 

Minimizing the Kullback-Leibler divergence between the distribution Pi(y) of the 
time-average neural firing rate yi and a target distribution function q(y) maximizing 
information entropy generates intrinsic adaption rules a and b for the gain a and 
the threshold b (polyhomeostatic optimization). 

Generating functionals may incorporate certain targets or constraints, either explicitly 
or implicitly. We denote the interplay between distinct objectives incorporated by 
competing generating functionals "objective functions stress". For the two generating 
functionals considered in this study there are two types of objective functions stress: 

FUNCTIONAL STRESS 

The minima of the energy functional are time-independent point attractors leading 
to firing-rate distributions Pi(y) which are sharply peaked. The target firing- 
rated distribution q(y) for the information-theoretical functional is however smooth 
(polyhomeostasis). This functional stress leads to the formation of an attractor 
relict network. 

SCALAR STRESS 

The mean target neural firing rate /i is a (scalar) parameter for the target firing-rate 
distribution q(y), and hence encoded explicitly within the information theoretical 
functional. The local minima of the energy functional, determined by the synaptic 
weights Wij, determine implicitly the mean activity levels a of the corresponding 
point attractors. Scalar objective function stress is present for a ^ \i. 

For the two generating functionals considered, we find that the scalar objective function 
stress induces a novel dynamical state, characterized by periods of slow laminar flow 
interseeded by bursts of rapid latching transitions. We propose that objective function 
stress is a powerful tool, in general, for controlling the behavior of complex dynamical 
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systems. The interplay between distinct objective functions may hence serve as a 
mechanism for guiding self organization [281 12H1 [30] . 
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